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Abstract 

P . 

■ Efficient communication in wireless networks is typically challenged by the possibility of 

interference among several transmitting nodes. Much important research has been invested in 
decreasing the number of collisions in order to obtain faster algorithms for communication in 
such networks. 

This paper proposes a novel approach for wireless communication, which embraces collisions 
rather than avoiding them, over an additive channel. It introduces a coding technique called 
Bounded-Contention Coding (BCC) that allows collisions to be successfully decoded by the 
receiving nodes into the original transmissions and whose complexity depends on a bound on 
the contention among the transmitters. 

BCC enables deterministic local broadcast in a network with n nodes and at most a trans- 
mitters with information of £ bits each within 0(alogn + a£) bits of communication with 
I/-) | full-duplex radios, and 0((a log n + a£)(\ogn)) bits, with high probability, with half-duplex 

radios. When combined with random linear network coding, BCC gives global broadcast within 
0((D + a + log n)(a log n + £)) bits, with high probability. This also holds in dynamic networks 
that can change arbitrarily over time by a worst-case adversary. When no bound on the con- 
tention is given, it is shown how to probabilistically estimate it and obtain global broadcast that 
is adaptive to the true contention in the network. 
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1 Introduction 



Handling interference in wireless networks is a fundamental challenge in designing algorithms for 
efficient communication. When two devices that are near each other transmit at the same time the 
result is a collided signal. In order to enable the receivers to obtain the original information that 
was sent, thereby achieving efficient communication that allows the design of fast algorithms for 
wireless networks, much important research has been invested in scheduling the transmissions in a 
way that avoids collisions as much as possible. 

Avoiding collisions basically requires some type of symmetry breaking among the nodes that 
want to transmit, to prevent them from transmitting at the same time. Simple solutions like Time 
Division Multiple Access (TDMA), which assigns predetermined slots according to node IDs, are 
expensive in situations where not all of the nodes want to transmit, since their costs depend on 
the total number of nodes rather than on the number of actual transmitters. One can improve this 
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solution by allowing nodes that cannot interfere with each other to share a slot, so that the number 
of slots depends on the node degrees in the network graph rather than on the total number of nodes. 
However, this requires the nodes to have information regarding the topology of the network, and is 
still expensive in cases where the contention is less than the node degrees. One successful approach 
for avoiding collisions in wireless networks is to allow each node to use a schedule of probabilities 
to decide whether to transmit at each time slot [H[T0,28!. These algorithms typically guarantee a 
high probability of successful transmissions after some bounded number of attempts. 

In this paper we provide a coding framework for coping with collisions in a wireless commu- 
nication model abstraction called the finite-field additive radio network model, where a collision 
of transmissions optimally coded for an Additive White Gaussian Noise (AWGN) channel with 
multiple-user interference is represented to be equivalent to the element-wise XOR of a string of 
bits representing the original transmissions. More generally, collisions can be modelled as being 
equivalent to the sum, symbol-wise, of the elements of vectors over a finite field, where the trans- 
mission of a user is represented as a vector in that finite field, the XOR case being the special 
case where the field is ¥2. Such a model has been shown to be approximately valid in a high SNR 
(signal-to-noise ratio) regime, abstracting away the effect of noise in the channels and allowing us 
to concentrate on the theoretical aspects of the interference among transmissions. Such a model in 
effect replaces the traditional information-theoretic setting of Gaussian inputs for an AWGN chan- 
nel with an approximate finite algebraic construct [3j[24j- Such additive models have been shown, 
albeit without a finite field construct, to be effective in high SNR settings even in the absence of 
underlying capacity-achieving codes, for instance in such systems as zig-zag decoding |16j . which 
can be modelled algebraically [36] . 

In this additive model, our key observation is that only if not all messages are valid transmissions 
then a sum representing a collision might indeed be uniquely decodable. However, we do not wish to 
restrict the information the users may send in the wireless system. Instead, we propose encoding 
the information sent into restricted sets of signals that do allow unique decoding when they collide. 
A node receiving a collision can then uniquely decode the signal to obtain the original unrestricted 
messages. Clearly, for information-theoretic reasons, we cannot hope to restrict the transmissions 
without the cost of some overhead in the amount of information sent. The challenge, then, is to 
find codes that allow unique decoding in the above setting with the shortest possible codewords. 
Under our high SNR assumption, we consider both half-duplex (sometimes termed time-division 
duplex - TDD) and full-duplex channels. While the TDD model is by far the most common current 
mode of operation, high SNR conditions can allow full-duplex operation. 

1.1 Our contributions 

The Bounded-Contention Coding (BCC) Framework: We define a new class of codes, 
which are designed for settings in which the number of transmitters is bounded by a known constant 
a. Each such code consists of an individual code for each node, and has the desirable property 
that, when codewords from at most a different transmitting nodes are summed up, the result can 
be uniquely decoded into the original transmissions. This decoding process does not require that 
the nodes know the identities of the transmitters. Moreover, the active nodes may change from 
round to round. We show simple constructions of Bounded-Contention Codes, where the length of 
the codewords depends on both the known contention bound and the total number of nodes, but 
the dependency on the total number of nodes is only logarithmic. 
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Distributed computation using BCC: Using the new Bounded-Contention Coding technique, 
we show how to obtain local and global broadcast in both single-hop and multi-hop networks. BCC 
enables deterministic local broadcast in a network with n nodes and at most a transmitters with 
information of £ bits each within 0(a log n + a£) bits of communication with full-duplex radios, and 
0((a log n + a^)(logn)) bits, with high probability, with half-duplex radios. When combined with 
random linear network coding, BCC gives global broadcast within 0((D + a + log n) (a log n + £)) 
bits. These results also hold in highly dynamic networks that can change arbitrarily over time 
under the control of a worst-case adversary. 

Further, we show how to remove the assumption that the nodes know a bound a on the con- 
tention, by developing a method for handling unknown contention (or contention that varies over 
space and time), which is common in wireless networks. Also, while it may be reasonable to assume 
a bound on the contention, it is often the case that the actual contention is much smaller. 

1.2 Related work 

The finite-field additive radio network model of communication considered in this paper, where 
collisions result in an addition, over a finite field, of the transmitted signals, was previously studied 
in (3j [M], where the main attention was towards the capacity of the network, i.e., the amount 
of information that can be reliably transmitted in the network. While the proof of the validity 
of the approximation [3] is subtle, the intuition behind this work can be readily gleaned from a 
simple observation of the Cover- Wyner multiple access channel capacity region. Under high SNR 
regimes, the pentagon of the Cover- Wyner region can, in the limit, be decomposed into a rectangle, 
appended to a right isosceles triangle [23]. The square can be interpreted as the communication 
region given by the bits that do not interfere. Such bits do not require special attention. In the 
case where the SNRs at the receiver for the different users are the same, this rectangle vanishes. 
The triangular region is the same capacity region as for noise-free additive multiple access channel 
in a finite field |13| . leading naturally to an additive model over a finite field. 

Note that, while we consider an equivalent additive finite-field additive model, this does not 
mean our underlying physical network model is reliant on symbol-wise synchronization between 
the senders. Asynchrony among users does not affect the behavior of the underlying capacity 
region [20], on which the approximate model is predicated. Nor are users required to have the same 
received power in order to have the finite-filed equivalence hold - differences in received power simply 
lead to different shapes of the Cover- Wyner region, but the interpretation of the triangular and 
rectangular decomposition of the Cover- Wyner region is not affected. Moreover, our assumption of 
knowing the interfering users is fairly standard in multiple access wireless communications. Issues 
of synchronization, SNR determination and identification of users are in practice handled often 
jointly, since a signature serves for initial synchronization in acquiring the signal of a user, for 
measuring the received SNR and also for identification of the transmitting user. Finally note that, 
as long as we have appropriate coding, then the Cover- Wyner region represents the region not 
only for coordinated transmissions, but also for uncoordinated packetized transmissions, such as 
exemplified in the classical ALOHA scheme [34J. This result, which may seem counterintuitive, 
is due in effect to the fact that the system will be readily shown to be stable as long as the 
individual and sum rates of the Cover- Wyner region will exceed the absolute value of the derivative 
of an appropriately defined Lyapunov function based on the queue length of a packetized ALOHA 
system. 

There has been work on optimization of transmissions over the model of [3]. These ap- 
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proaches [2]]151l39j generally provide algorithms for code construction or for finding the maximum 
achievable rate, for multicast connections, over a high SNR network under the model of [3]. The 
approach of [14, 24, 25j considers a more general finite- field model and reduces the problem to an 
algebraic network coding problem [26]. Random code constructions, inspired from |19j are then 
with high probability optimal. These approaches differ from our work in this paper in that they 
are interested in throughput maximization in a static model rather than completion delay when 
multiple transmission rounds may occur. We are interested in the latter model and, in particular, 
in how long it takes to broadcast successfully a specific piece of information. 

There are many deterministic and randomized algorithms for scheduling transmissions in wire- 
less networks. They differ in some aspects of the model, such as whether the nodes can detect 
collision or cannot distinguish between a collision and silence, and whether the nodes know the en- 
tire network graph, or know only their neighbors, or do not have any such knowledge at all. Some 
papers that studied local broadcast are [H1E7], where deterministic algorithms were presented, 
and j6j,[2TJ[33] , which studied randomized algorithms. 

In the setting of a wireless network, deterministic global broadcast of a single message was 
studied in [TQHSIEH] , the best results given being O(nlogn) and 0(nlog 2 D), where D is the 
diameter of the network. Bar- Yehuda et al. [4] were the first to study randomized global broadcast 
algorithms. Kowalski and Pelc [28] and Czumaj and Rytter [TO] presented randomized solutions 
based on selecting sequences, with complexities of 0(D log + log 2 n). These algorithms match 
lower bounds of [H|32] but in a model that is weaker than the one addressed in this paper. The 
algorithms mentioned above are all for global broadcast of one message from a known source. For 
multiple messages, a deterministic algorithm for k messages with complexity 0(k log 3 n + n log 4 n) 
appears in [7j, while randomized global broadcast of multiple messages was studied in [5,22,23j. 
We refer the reader to an excellent survey on broadcasting in radio networks in |37j . 

Wireless networks are not always static; for example, nodes may fail, as a result of hardware or 
software malfunctions. Tolerating failed and recovered components is a basic challenge in decentral- 
ized systems because the changes to the network graph are not immediately known to nodes that 
are some distance away. Similarly, nodes may join and leave the network, or may simply be mobile. 
All of these cases result in changes to the network graph that affect communication. Depending on 
the assumptions, these changes can be quite arbitrary. Having a dynamic network graph imposes 
additional challenges on designing distributed algorithms for wireless networks. Dynamic networks 
have been studied in many papers. The problems addressed include determining the number of 
nodes in the network, gossiping messages, data aggregation, and distributed consensus [9| [29H3T] . 
For global broadcast, some papers assume restrictions on the changes made in each round. For ex- 
ample, [8] consider graph changes that are random. They also consider the worst-case adversary, as 
do the studies in (29j,[35] . In [29] collisions are abstracted away, so that edges of the network graph 
do not represent nodes that hear the transmissions, but nodes that actually obtain the message. 
In [17], the authors show how to use network coding in order to obtain more efficient algorithms 
for global broadcast in this dynamic model. 

2 Network Abstraction 

We consider a wireless network where the transmission of a node is received at all neighboring nodes, 
perhaps colliding with transmissions of other nodes. Formally, the network is represented by an 
undirected graph G = (V,E), where \V\ = n. We denote by N(u) the subset of V consisting of all 
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of it's neighbors in G and by D the diameter of the network. The network topology is unknown. 

We address two different radio models. One is the full-duplex model, in which nodes can listen 
to the channel while transmitting. The second is the half-duplex mode, in which at every time, 
a node can either transmit or listen to the channel. A transmission of a node v E V is modeled 
as a string of bits s v . The communication abstraction is such that the information received by a 
listening node u € V is equal to ©^gjyfw) where the operation © is the bit-wise XOR operation. 

The model is further assumed to be synchronous, that is, the nodes share a global clock, and a 
fixed slot length (typically 0(polylog n)) is allocated for transmission. 

Most of the paper assumes a bound a < n on the contention that is known to all nodes. However, 
the actual contention in the network, which we denote by a', may be even smaller than a. Each 
node has a unique ID from some set / of size |/| = N, such that N = n ^. 

3 Bounded-Contention Codes 

To extract information from collisions, we propose the following coding technique for basic Bounded- 
Contention Coding, in which each node encodes its message into a codeword that it transmits, in 
such a way that a collision admits only a single possibility for the set of messages that produced 
it. This enables unique decoding. 

Definition 1 An [M, m, a]-BCC-code is a set C C {0, l} m of size \C\ = M such that for any two 
subsets Si,S% C C (with Si / S2) of sizes \Si\, IS2I < a it holds that ®Si 7^ ©«S*2 ; where ® A~ of 
X = {x±, . . . , x~t} is the bit-wise XOR x,\ © • • • © x t . 

As a warm-up, we start by giving an example of a very simple BCC-code. This is the code of 
all unit vectors in {0, i.e., C = {xi\l < i < M,Xi(j) = 1 if and only if i = j}. It is easy to 
see that C is an [M, M, M]-BCC-code, since every subset S C C is of size s < M, and we have 
0S = i s , where x s (j) = 1 if and only if xi € S, implying that Definition [T] holds. 

The parameter M will correspond to the number of distinct transmissions possible throughout 
the network; for example, it could correspond to the number of nodes. The parameters a and m 
will correspond to contention and slot length, respectively. Therefore, the BCC-codes that interest 
us are those with m as small as possible, in order to minimize the amount of communication. The 
above simple code, although tolerating the largest value of a possible, has a very large codeword 
length. Hence, we are interested in finding BCC codes that trade off between a and m. To show that 
such good codes exist, we need the following basic background on linear codes. An [M,k,d] -linear 
code is a linear subspace of {0, 1} M , (any linear combination of codewords is also a codeword) of 
dimension k and minimum Hamming weight d (the Hamming weight of a codeword is the number 
of indexes in which the codeword differs from zero). The dual code of a linear code D, denoted 
D^, is the set of codewords that are orthogonal to all codewords of D, and is also a linear code. It 
holds that (D-*-) 1 - = D. The construction for arbitrary [M, m, a]-BCC-codes works as follows. 

BCC construction: Let D be a linear code of words with length M and Hamming weight 
at least 2a + 1. Let {x\, . . . ,x m } be a basis for the dual code D 1 - of D. Let H be the m x M 
parity-check matrix of D, i.e., the matrix whose rows are x±, . . . , x m , and let C be the set of columns 
of H. We claim that C is the desired BCC-code. 

Lemma 1 The code C constructed above is an [M,m,a]-BCC-code. 
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Proof of Lemma [1} It is clear from the construction that there are M codewords in C, each 
of length m. Assume that C is not an [M, m, a]-BCC-code. Then by Definition [H there are two 
subsets S\,S2 C C of sizes |£i|,|52| < a, respectively, such that ©Si = ©S^- Without loss of 
generality we can assume that S± n S2 = because adding the same codeword to both sets does 
not effect the equality of their XORs. This implies that © S% U S2 = 0. However, it is well known 
(see [38, Theorem 2.2]) that every set of d — 1 columns of H, where d is the minimum Hamming 
weight of D, is linearly independent In our case, d = 2a + l and \S\ US2I < 2a, which contradicts 
having © Si U S2 = 0, giving that C is an [M, m, a]-BCC-code. ■ 

As the following sections will show, we need [M, m, a]-BCC-codes with m as small as possible and a 
as large as possible. By Lemma [H this means we need to find linear codes of dimension k = M — m 
as large as possible and minimum Hamming weight d > 2a + 1 as large as possible. Note that we 
are only interested in the existence of good codes as the ID of a node will imply the codewords 
assigned to it, requiring no additional communication. 

Lemma 2 There is an [M,m,a]-BCC code with m = O(alogM). 

Proof of Lemma [2} The Gilbert-Varshamov bound [38J says that 2 k > , n -. for codes 

22j=o \ j) 

of length M, dimension k, and minimum Hamming distance d. This implies that k > M — 
log(d(^rf) rf X ) which is 0(M — dlog^f). In our notation, this gives m = 0(a log M). A greedy 
algorithm that repeatedly adds as a codeword an element of {0, 1} M that is not in a ball of distance 
d around any previously chosen codeword clearly attains the Gilbert-Varshamov bound. A slight 
modification also produces such a code that is also linear (see, e.g., [38, Theorem 4.4]). By LemmaHJ 
using this in the above BCC construction gives an [M, m, a]-BCC code with m = 0{a log M). ■ 

Lemma [2] implies, for example, that there are BCC-codes with a = 0(logM) and m = 0(log M ■ 
logM) = 0(log 2 M). As explained earlier, for solving the problem of local broadcast, the parame- 
ters a and m correspond to the contention and the transmission length, respectively. As we show 
in the next section, such BCC-codes with polylogarithmic parameters are well-suited for the case 
of bounded contention, hence we refer to them as Bounded-Contention Codes. 

In fact, the BCC-codes presented above are optimal, since Jl(alogM) is a lower bound for m. 
The reason for this is that each XOR needs to uniquely correspond to a subset of size at most a 
out of a set of size M. The number of such subsets is (^) , therefore each codeword needs to have 
length fi(]og (^)) = n(oIogM). 

4 Local Broadcast 

This section shows how to use BCC-codes for obtaining local broadcast in the additive radio 
network model. The simplest way to illustrate our technique is the following. Assume that in 
every neighborhood there are at most a participants, and each node needs to learn the IDs of all 
participants in its neighborhood. The nodes use an [N, a log N, a]-BCC code to encode their IDs 
and, since at most a nodes transmit in every neighborhood, every receiver is guaranteed to be able 

For every set SCC, let ys be the length M vector characterizing the set S, i.e., y~s(i) = 1 if and only if the i-th 
column is an element in 5*. Consider the multiplication of the matrix H by the vector ys- If H ■ j?s = then y is 
orthogonal to D ± , hence y 6 (D ± ) ± — D. This implies that the Hamming weight of y is at least 2a + 1. 
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to decode the received XOR into the set of local participants. For the case of a single-hop network, 
we show how this information can then be used in order to assign unique transmission slots for 
the participants. However, while this shows the simplicity of using BCC-codes for coping with 
collisions, it does not extend to multi-hop networks since these require more coordination in order 
to assign slots for interfering transmitters (who can be more than a single hop from one another, 
as in the case of a hidden terminal). Instead, we show how to use BCC-codes for directly coding 
the information rather than only the IDs of the transmitters. 

4.1 Single-hop Networks 

For the case of a single-hop network, the graph G is a complete graph: in each time slot every 
listening node receives an XOR of all the strings transmitted in this slot by all other nodes in the 
network. We assume a bound a on the contention, but our approach allows a slight improvement by 
being adaptive to the actual number of transmitters, a', rather than only depending on its bound, 
a. We do this by assigning a few slots for the purpose of just finding out which are the nodes that 
want to transmit and agreeing on an allocation of slots for them in which each transmitter v will 
transmit alone, guaranteeing that all other nodes receive its data s v successfully. We start with the 
full-duplex model, which means that nodes can listen and transmit at the same time. 

Algorithm 1 Local broadcast using BCC, code for transmitter u with data s u . 

1: Transmit codeword C(u) from an [N, a log N, a]-BCC code C, 

and receive c = C(v), where S € V is the set of nodes that transmit. 

2: Order nodes S by increasing IDs and transmit s u at your turn. 



Since this is a single-hop network, all nodes receive c = © u& g C(v), where S € V is the set of 
nodes that want to transmit. Recall that we are assuming only small contention in the network, 
which means formally that \S\ = a' < a. Since C is an [N, a log N, a]-BCC-code, c can be uniquely 
decoded into the set {C(v)\v G S}, and, more important, the set S can be uniquely identified by 
all nodes. This means that all nodes know after the first slot exactly who wants to transmit. This 
implies an agreement on the number of slots needed for each of the to transmit alone, as well as an 
order for their transmitters by increasing IDs. 

After sending the codewords, the transmitters actually go ahead with their transmissions ac- 
cording to the slot assignment, which means that a! < a slots are used, each of length t. This sums 
up to 0(a log n + a'£) bits. We can improve this even further by noticing that we do not need to 
bound in advance the length £ of the information transmitted. The reason for this is that each 
transmitting node now has its own assignment, and could transmit for an unbounded number of 
slots while ending its transmission with some predetermined signal, allowing the next transmitter 
to go ahead. This does not require more than a very small constant overhead in the length of the 
transmission, and it implies adaptivity in terms of the message length (as well as in terms of the 
actual contention). This gives the following result. 

Theorem 3 In a single-hop network with a' < a transmitters Vi, i € {1, . . . , a'} with information 
of l Vi bits each, Algorithm^ gives that every node receives all the information within 0(a(logn) + 

EiiAn) bits. 
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With half-duplex radios, we let each node choose whether it listens or transmits (if needed) 
with probability 1/2. This gives that for every message and every node v, in each round there 
is probability 1/4 for the message to be transmitted and heard by v. In expectation, a constant 
number of rounds is needed for v to hear any single message, and using a standard Chernoff bound 
implies that O(logn) rounds are needed with high probability. Finally, a union bound over all n 
nodes and all messages gives the following theorem. 

Theorem 4 In a single-hop network with a' < a transmitters Vi, i G {1, . . . , a'} with information 
of £ Vi bits each, the modification of Algorithm^ gives that every node receives all the information 
within 0(a(log 2 n) + 2i=i ^«t) bits, with high probability. 

4.2 Multi-hop Networks 

For multi-hop networks we first consider the same problem of local broadcast: every node, whether 
or not it is a transmitter, must receive all the messages sent by its neighbors. For this subsection, 
we assume that the bound a on the contention is a local bound, that is, there are at most a nodes 
in every set N(u) throughout the network that want to transmit. 

We assume the real data that a node v wants to transmit may be any element s G {0, 1} . 
Instead of encoding the IDs of the nodes in the network, we use an [iV2^, m, a]-BCC code C and 
every node v is assigned 2 e codewords {C(v,s)\s G {0,1}^} for it to use. This implies that the 
length of the codewords is m = 0(a log (N2 e )) = 0(a(logN + £)) = 0(a(logn + £)). Notice that 
this is optimal since a log n is required in order to distinguish subsets of size a among n nodes, and 
the a£ term cannot be avoided if a nodes transmit £ bits each. 

With half-duplex radios, we let each node choose whether it listens or transmits (if needed) 
with probability 1/2. This gives that for every message and every node v, in each round there 
is probability 1/4 for the message to be transmitted and heard by v. In expectation, a constant 
number of rounds is needed for v to hear any single message, and using a standard Chernoff bound 
implies that O(logn) rounds are needed with high probability. Finally, a union bound over all n 
nodes and all messages gives the following theorem. 

Theorem 5 In a multi-hop network with at most a transmitters with information of £ bits each in 
each N(u), local broadcast can be obtained within 0((a log n + at) log re) bits, with high probability. 

5 Global Broadcast 

In this section we show how to obtain global broadcast by combining BCC and network coding. We 
assume that at most a nodes have a message of £ bits each that needs to be received by all nodes of 
the network. We first briefly introduce random linear network coding (RLNC) as a solution to the 
global broadcast problem in additive radio networks and then, in Subsection 15.21 show how BCC 
can significantly reduce the coding coefficient overhead of RLNC when a « n. 

5.1 Random Linear Network Coding 

RLNC is a powerful method to achieve optimal global broadcast, in particular in distributed net- 
works in which nodes cannot easily coordinate to route information through the network. Instead of 
sending pieces of information around directly, RLNC communicates (random) linear combinations 
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of messages over a finite field F q . In this paper we will choose the field size q to be 2 which allows 
us to see vectors in F q simply as bit-vectors and linear combinations of vectors as XORs. 

We denote with m u € F2 the message sent out by node u and denote with S the set of at most a 
nodes that initially have a message. Given this, any packet sent out during the RLNC protocol has 
the form (//, Yluei ^« m «) e where \x € F 2 N is a coefficient vector indicating which messages are 

XOR-ed together in the second portion of a packet, i.e., a characterizing vector. We call packets of 
this form valid. A node u that initially starts with a message m u treats this message as if it received 
the packet (e u , m u ) before round one, where e u is the standard basis vector corresponding to u (that 
is, with a one at the coefficient corresponding to u, and zeros otherwise). During the protocol, each 
node that is supposed to send a packet takes all packets it has received so far and includes each of 
them independently with probability 1/2 in the new packet. The new packet is formed by taking 
the XORs of all packets selected in this way (if no packet is selected the node remains silent or 
alternatively sends the all zero vector). Nodes decode by using Gaussian elimination. This can 
be done if and only if a node has received a valid packets with linearly independent coefficient 
vectors. We note that, because of linearity, all initial packets and all packets created during the 
RLNC protocol are valid. More importantly, if multiple neighbors of a node send valid packets 
then the XOR of these packets which is received is also valid since the coefficient vectors and the 
message part XOR separately and component-wise. This makes RLNC a simple but powerful tool 
for exploiting the linear and additive nature of the additive radio networks we study in this paper. 

Algorithm 2 Global broadcast using RLNC, code for transmitter u with data s u . 

0: Initially, S u contains a single packet (e u ,m u ) 
1: FOR round i = 1, . . . , 32(£> + a + log N): 
2: With probability 1/2 DO: 

3: Listen to channel for N + 1 bits and update S u , the set of packets known to u 
4: OTHERWISE 

5: Send a packet (MjXXsS I^v^t-v), 

6: which is an XOR of a subset of packets in S u , each chosen with probability 1/2. 



We analyze the complexity of this RLNC scheme when used on top of an additive radio network. 
As in Section [H nodes can either transmit or listen to the channel at any given round since they 
have half-duplex radios. We use the above RLNC algorithm together with the strategy of choosing 
in each round whether to transmit or listen at random with probability 1/2. 

We show that the RLNC protocol achieves an optimal round complexity of 0(D + a + logn) 
with high probability. Our proof is based on the projection analysis technique from [18] but we 
give a simple, self-contained proof here. The reason that the analysis carries over from a message 
passing model to the radio networks considered here so easily is their additivity. In particular, we 
use the effect that the XOR of randomly selected packets sent out by several neighbors which get 
XORed in the air are equivalent to the XOR of a random selection of packets known to at least 
one neighbor. 

Theorem 6 RLNC disseminates alia messages, with high probability, in 0(D + a + log n) rounds 
in which messages of N + £ bits are sent in each round. 

Proof of Theorem [6l We say a node knows a coefficient vector fi £ FJy if it has received a 
packet with a coefficient vector that is non-perpendicular to \i (over GF(2)). We claim that for any 
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non-zero vector ^ the probability that any fixed node v does not learn within 0(D + a + logn) 
rounds is at most 2^ a+21ogn ). Then, a union bound over all nodes and all 2 a coefficient vectors 
in (i G Fg initially shows that, with high probability, all nodes know all vectors in the span of the 
messages given away initially. Finally, it is then easy to conclude that all nodes can decode. 

To prove this claim we look at a shortest path P from v to a node that initially knows ji (i.e., 
starts with a message with a non-zero coefficient in jjl). At any round t, let node u be the closest 
node to v on P that knows fi. There is a 1/2 chance that u sends and an independent 1/2 chance of 
1/2 that u sends out a packet with a coefficient vector that is non-perpendicular to u. Thus, in any 
round independently with probability at least \ knowledge of /U makes progress on P. Therefore, 
in 32(D + a + log n) rounds the expected number of rounds that make progress is 8(D + a + log n). 
A standard Chernoff bound shows that the probability that less than D progress is made in these 
32(D + a + logn) rounds is at most 2~ <a+21o s ri ) as claimed. ■ 

5.2 Reducing the Overhead of Random Linear Network Coding via BCC-Codes 

Note that ?? shows that RLNC has an essentially optimal round complexity. In particular, Q,(D) is 
a trivial lower bound since information passes at most one step in the network per round and 12(a) 
is a lower bound too since in each round at most i bits worth of information messages are received 
while at bits need to be learned in total. Lastly, the Q(logn) factor is tight for the proposed 
algorithm, too, because of the randomness used. On the other hand, the packets sent around have 
size N + 1 while carrying only t bits of information about the messages. Note that N > n and in 
many cases N = n c » i for some constant c which renders the standard RLNC implementation 
highly inefficient. 

The reason for this is that we use an N bit vector as a header to describe the set of IDs of 
nodes whose message is coded into the packet. This vector is always extremely sparse since at most 
n « N nodes are present and at most a « n nodes are sending a message. Instead of writing 
down a vector as is one could thus try to use a short representation of these sparse vectors. Writing 
down only the IDs of the non-zero components would be such a sparse representation (with almost 
optimal bit size a log N) but does not work here, because when multiple neighbors of a node send 
sparse coefficient vectors their received XOR cannot be uniquely decoded. BCC-codes solve exactly 
this problem, by providing a sparse vector representation: 

Definition 2 Let I be an ID set of size N and a be a sparseness parameter. Any [iV, a log N, a] - 
BCC code C mapping any ID u G / to C(u) G F 2 alogAf induces a sparse vector representation s 
that maps the vector /i G FJ? to s(u) = J2 u \fi u =i C{u). 

The following two properties make this representation so useful (in particular in this context): 

Lemma 7 Let I be an ID set of size N, a be a sparseness parameter and s be a sparse vector 
representation induced by a BCC-code C . For any two vectors n, u' G FJ? with at most a non-zero 
components we have: 

• Unique Decodability: /i/^' =^> s(fi) ^ s(/j,'). 

• Homomorphism under addition: s(u) + s( / u / ) = s(u + fi'). 
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Replacing the coefficient vectors \x in the RLNC scheme with their sparse representation leads 
to the much more efficient RLNC+BCC scheme. 

As in the RLNC protocol, we denote with m u € F% the message sent out by node u and denote 
with S the set of at most a nodes that initially have a message. Any packet sent out during the 
RLNC+BCC protocol has the form (/i, Yluei / u « m «) e p^ l °s N + ^ ^ pa\ogN - g & coc ^ 

coefficient vector indicating which messages are XOR-ed together in the second portion of a packet, 
i.e., it holds the XOR of the BCC-codewords of the IDs of the messages. As before, each node 
that is supposed to send a packet takes all packets it has received so far and includes each of them 
independently with probability 1/2 in the new packet. The new packet is formed by taking the 
XORs of all packets selected in this way, preceded by the corresponding coded coefficient vector. 
Nodes decode by using Gaussian elimination, which can be done if and only if a node has received 
a valid packets with linearly independent coefficient vectors. We note that, because of linearity, all 
initial packets and all packets created during the RLNC protocol are valid. Unlike having a list of 
IDs as a sparse representation of the coefficient vector, the power of BCC here is that if multiple 
neighbors of a node send valid packets then the XOR of these packets which is received is also valid 
since the BCC-coded coefficient vectors and the message part XOR separately and component- wise. 
Formally, the algorithm is identical to RLNC, except that the set S u of messages received by node 
u is initialized to (C(u),m u ) instead of (e u ,m u ), and the node listens for a log N + I bits, rather 
than N + i. 

Algorithm 3 Global broadcast using RLNC+BCC, code for transmitter u with data s u . 

0: Initially, S u contains a single packet (C(u),m u ) 

1: FOR round i = 1, . . . , 32(D + a + log N): 

2: With probability 1/2 DO: 

3: Listen to channel for a log N + £ bits and update S u , the set of packets known to u 

4: OTHERWISE 

5: Send a packet (//, ^2 v£Su fJ, v m v ), 

6: which is an XOR of a subset of packets in S u , each chosen with probability 1/2. 



Theorem 8 RLNC+BCC disseminates all a messages, with high probability, in 0(D + a + logn) 
rounds in which messages of 0(alogn + 1) bits are sent in each round. 

6 Dynamic Networks 

In this section, we consider the case of a highly-dynamic network with a worst-case adversary: in 
every round, that is, between the times nodes send packets, the network graph is determined by 
the adversary, which observes the entire computation so far when deciding upon the graph for the 
next round. Notice that all of the above results for local broadcast hold for such dynamic networks, 
given that a slot length is sufficiently long in order to contain the required information. This is, 
for example, 0(a(logn + £)) bits in the full-duplex model, which is reasonable to assume if a and 
I are not too large. We get absolute guarantees for local broadcast in this highly dynamic setting, 
while existing work on avoiding collisions in radio networks cannot achieve this since they require 
probabilistic transmissions. 
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Next, we generalize the RLNC+BBC framework for the case of this highly-dynamic network 
with a worst-case adversary. The only restriction is that the graph has to be connected in every 
round. The proof of the resulting theorem is essentially the same as for the static case but instead 
of arguing that every message makes progress over a shortest path P, we argue that it makes some 
progress since the graph is always connected. Hence D is replaced by n in the number of rounds 
needed. 

Theorem 9 In a dynamic additive radio network controlled by an adaptive adversary subject to 
the constraint that the network is connected at every round, RLNC+BCC achieves global broadcast 
of a messages, with high probability, in 0{n + a + log n) rounds using a packet size of 0(a log re + £) 
bits. 

Proof of Theorem [9l Let S be a set of a messages. We first analyze the algorithm given that a 
is known, and then use the same estimation technique as before to address the case of an unknown 
o. For a coefficient vector /i G we measure progress by counting how many nodes know about 
it. Clearly, initially at least one node knows about //, while we want all n nodes to know about it 
in the end. If in a round r this is not achieved yet, we claim that there is a probability of at least 
1/4 for at least one more node to learn /i. This is true because the graph is connected and hence 
there is at least one node v that knows \x which is connected to a node u that does not know \i. 
As before, the node u has a probability of at least 1/4 to learn [i in this round r. As in the proof 
of ??, a Chernoff bound shows that the probability that there were not enough (less than n — 1) 
such successes in 32(n + a + log n) rounds is at most 2~( a+logn ). Finally, a union bound over all 2 a 
vectors completes the proof. For an unknown a, the same estimation technique works here since 
we only require progress through some path to every node, which is promised since the graph is 
connected at every round. ■ 



7 Estimating the Contention 

We have given an almost optimal scheme for achieving global broadcast when the number of senders 
a (or a good upper bound on it) is known a priori. This assumption is not an unreasonable one, for 
example, if network statistics show such behavior. However, in many cases, local contention may 
differ at different places throughout the network, or vary over time. It may also be that the known 
bound is pessimistic, and the actual contention is much smaller than this bound. In this section, 
we show a method for removing this assumption by using BCC-codes to quickly determine a (and 
also reveal the identity of all senders). 

The mechanism we present allows for estimating the current contention and then using a code 
that corresponds to that estimate. A standard way to obtain a good estimation of contention is by 
having the nodes double a small initial guess until they succeed in local broadcast. In our BCC 
framework, the tricky part of this approach is to identify success. Specifically, using a bounded- 
contention code with parameter a for a set S of k > a transmitters may produce an XOR that is a 
valid XOR of some set S' of k' < a transmitters. Hence, the nodes need to be able to distinguish 
between such a case and the case where S' is the true set of transmitters. 

The idea behind our algorithm is simple. We use an [N, 2k log N, 2£;]-BCC code to send out IDs 
in every round to make them propagate through the network. Every node u keeps track of the set 
S u of all IDs it has heard from so far. In every round node u sends out an XOR in which each of 
the IDs in S u is independently included with probability 1/2. If k > a then nodes receive the sum 
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of at most a nodes in every round and are able to split this sum into IDs which are then added to 
their sets. This way an ID propagates from one node to the next with constant probability and we 
show that within 0(D + log re) rounds every node, with high probability, receives the ID of every 
node that wants to send. However, if k < a we may get XORs of more than 2k IDs, which have 
no unique decoding guarantees by the BCC-code. The following algorithm takes care of this by 
detecting such a case eventually (and sufficiently fast). 

Algorithm 4 Estimating the Contention a, pseudocode for node u. 
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k 4- 2 

REPEAT UNTIL fail u = false and \S U \ < k 
k -i— 2k 
fail u ^— false 

C 4- [N,2k\ogN,2k]-VCC code 
IF node u is a sender 

S u 4- {C(u)} 
ELSE 

FOR iteration i = 1, . . . , 32(D + log n): 



IF fail u 

send log re random bits 
ELSE 

listen for log re random bits 

IF received a non-zero string 
failu ^— true 
With probability 1/2 DO 

Send J2veS X v C(v) where X v are i.i.d. uniformly Bernoulli 
OTHERWISE 

listen for 2klogN bits 

IF what received can be decoded as X^es C(v) foic & \S\ < k 

Su 4 S u u s 

ELSE 

fail u <— true 



Theorem 10 With high probability, Algorithm [^] correctly identifies the subset of senders S at 
every node after a total amount of communication of 0((D + log n)(a log re)) bits. 

Proof of Theorem llOt We first show that at the end of each iteration in which k < a, with 
high probability, every node u has fail u = true or \S U \ > k. To show this, we argue that for every 
node u and every ID of a sender s, either fail u = true or C(s) £ S u . Initially, by Lines 6-7, this 
is true for the sender s itself. Let P be a shortest path from u to s, and let w be the closest node 
to u on P that has fail w = true or C(s) £ S w - If fail w = true then w sends on Line 13, and this 
indication of failure makes progress on P, with high probability (the probability for a zeros string 
is exponentially small in its logarithmic length). Otherwise, C(s) € S w , and there is a probability 
of 1/2 that w transmits and an independent probability of 1/2 that w sends out a packet with 
X s = 1 on Line 20. 
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Fix the set X of IDs x ^ s for which C(x) is included in the XOR that v, the neighbor of w 
along P, receives and consider the following cases. If X is decoded into a set of size larger than k 
or into a non- valid set then, with probability at least 1/4, C(s) is not included in the XOR and it 
is correctly decoded to produce fail v ^— true. Otherwise, X is decoded into a set of size at most k 
and then, with probability at least 1/4, C(s) is included in the XOR and v decodes a valid set of 
size at most k + 1. If the size is k + 1 then fail v <(— true and otherwise C(s) is added to S v . 

Thus, in any round, independently with probability at least 1/4, knowledge of C(s) or the 
indication of failure makes progress on P. Therefore, in 32(D + logn) rounds the expected number 
of rounds that make progress is 8(Z) + logn). A standard Chernoff bound shows that the probability 
that less than D progress is made in these 32(D + logn) rounds is at most 2" 21ogra as claimed. 
Then, a union bound over all n nodes and all k < n senders shows that, with high probability, 
every node u has fail u = true or S u > k. 

This implies that, with high probability, when k < a all nodes double their estimate and proceed 
to the next iteration. The same analysis shows that if the algorithm reaches an iteration in which 
k > a then the algorithm stops, with high probability, with each node having identified S, since a 
failure indication is never produced. The number of rounds in each iteration is 0(D + logn) and 
the number of bits per round in an iteration with estimate k is 0(klog N), which implies that the 
total number of bits communicated is J2k 0((D + log n)(fc logn)) = 0((D + log n)(a logn)). ■ 

One can use this procedure not just to estimate a but also to exploit the fact that it gives 
the IDs of all senders in order to simplify the RLNC algorithm. For this, we order the IDs of the 
senders and assign to the i highest node the ith. standard basis vector out of the space Fg . We then 
use this as a sparse and concise coefficient vector in the RLNC protocol. This gives the following: 

Corollary 11 After running the BCC-Estimation algorithm, RLNC achieves global broadcast, with 
high probability, in 0(D + a + logn) rounds in which packets of size a + i bits are sent in each 
round. 

8 Discussion 

This paper presents a coding technique for additive wireless networks, which allows efficient local 
and global broadcast given a bound on the amount of contention. It also shows how to estimate 
the contention when it is not known in advance. The results hold also for dynamic networks 
whose arbitrary changes are controlled by a worst-case adversary. For full-duplex radios, it gives a 
deterministic framework providing absolute guarantees. 

Directions for further research include using BCC-codes for solving additional distributed prob- 
lems in the additive wireless network model, and handling extensions to the model, such as noise 
and asynchrony. 
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